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The severe acute respiratory syndrome coronavirus (SARS-CoV) nonstructural proteins nsp1 to nsp16 have 
been implicated by genetic analysis in the assembly of a functional replication/transcription complex. We 
report the crystal structure of nsp10 from SARS-CoV at 2.1-A resolution. The nsp10 structure has a novel fold, 
and 12 identical subunits assemble to form a unique spherical dodecameric architecture. Two zinc fingers have 
been identified from the nsp10 monomer structure with the sequence motifs C-(X),-C-(X)<-H-(X),-C and 
C-(X),-C-(X),-C-(X)-C. The nsp10 crystal structure is the first of a new class of zinc finger protein three- 
dimensional structures to be revealed experimentally. The zinc finger sequence motifs are conserved among all 
three coronavirus antigenic groups, implicating an essential function for nsp10 in all coronaviruses. Based on 
the structure, we propose that nsp10 is a transcription factor for coronavirus replication/transcription. 


Coronaviruses (CoVs) are positive-stranded RNA viruses 
known to cause highly prevalent respiratory and enteric dis- 
eases in humans and animals. About two-thirds of their ge- 
nomes encode the viral replicase that mediates viral RNA 
synthesis (19). The replicase gene is comprised of two open 
reading frames at the 5’ end of the genome, termed ORFla 
and ORF1b (26). The upstream ORFla encodes polyprotein 
ppla (450 to 500 kDa), while ORFla and ORF1b together 
encode pplab (750 to 800 kDa). Expression of the ORF1b- 
encoded half of pplab requires a —1 ribosomal frameshift 
during translation. The polyproteins ppla and pplab undergo 
extensive proteolytic processing by viral proteases to produce 
16 functional subunits known as nonstructural proteins, which 
then assemble to form the replicase complex required for viral 
replication and transcription. 

Prior to the global severe acute respiratory syndrome 
(SARS) outbreak in 2003, scant attention was paid to corona- 
viruses by researchers, as this genus of viruses causes severe 
diseases predominantly in animals and only comparatively mild 
diseases in humans. In the wake of the SARS outbreak, greater 
attention has been focused on the replicase proteins with a 
view to understanding the replication/transcription machinery 
and to identify new therapeutic targets. To date, the three- 
dimensional structures of a series of nsp proteins have been 
reported (1). nsp5, also called the main protease or 3C-like 
protease, was the first SARS protein structure determined in 
2003 (24) and has since been the focus of concerted efforts for 
the design of antiviral inhibitors. Last year, a broad-spectrum 
inhibitor was reported with efficient in vitro inactivation of 
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multiple coronavirus main proteases, potent antiviral activity, 
and extremely low cellular toxicity (23). The structure of nsp9 
was determined in 2004, and it was found to be a single- 
stranded RNA binding protein (4, 17). More recently, the 
complex structure between nsp7 and nsp8 revealed a hexa- 
decameric assembly that should constitute a processivity factor 
for nsp12, an RNA-dependent RNA polymerase (25). Also 
determined recently was the ADP ribose 1-phosphatase do- 
main of nsp3 (14). 

nsp10 and nsp11 immediately precede nsp12 in the pplab 
sequence and have been implicated in RNA synthesis, while 
the mechanism remains to be illustrated through both struc- 
tural and functional studies. To improve our understanding of 
functions of CoV nsp proteins, we carried out systematic struc- 
tural studies on this group of proteins. We report the crystal 
structure of nsp10. This protein appears to have a novel fold, 
featuring two zinc fingers with C-(X)j-C-(X)5-H-(X),-C and 
C-(X),-C-(X),-C-(X)-C motifs. Furthermore, 12 identical sub- 
units assemble into a novel spherical dodecameric architecture 
which is proposed to be a functional form of nsp10. 


MATERIALS AND METHODS 


Protein expression, purification, and characterization. The coding sequence 
for SARS-CoV nsp10-nsp11 was amplified by PCR from the SARS-CoV BJ01 
strain (corresponding to Ala4231 to Val4382 of the ORFla polyprotein) and 
inserted into pGEX-6p-1 plasmid DNA (Amersham Biosciences) using BamHI 
and Xhol sites. The forward and reverse PCR primers used for amplification 
were F’-nsp10-nspl11 (5'’-CGGGATCCGCTGGAAATGCTACAGAAGT-3’) 
and R'-nsp10-nsp11 (5'-CGCTCGAGTTACACCGCAAACCCGTTTA-3’), re- 
spectively. The protein was expressed in Escherichia coli strain BL21(DE3) as a 
glutathione S-transferase (GST) fusion protein and purified using glutathione 
affinity chromatography. The GST tag was removed by PreScission protease 
(Amersham Biosciences), leading to five additional residues (GPLGS) at the N 
terminus. The protein was further purified by a G25 desalting column in 2 mM 
dithiothreitol (DTT) and concentrated to 10 mg ml! in preparation for crys- 
tallization screening. 
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TABLE 1. Data collection and refinement statistics 


Statistical element 


nsp10 nsp10 in the absence of zinc 


Data collection statistics 
Cell parameters 


Space group C222, 

Resolution (A) 50 (2.2)°-2.1 

No. of all reflections 1,545,407 

No. of unique reflections 276,259 

Completeness (%) 99.5 (99.4) 

Average I/o(1) 10.5 (4.2) 

Rinerge’ (%) 6.1 (36.3) 
Refinement statistics 

No. of reflections used (o(F) > 0) 468,890 

Ryork (%) 21.9 

Rie (%) 24.7 

RMSD bond distance (A) 0.013 

RMSD bond angle (°) 1.67 

Average B value (A?) 31.8 
Ramachandran plot (excluding Pro and Gly) 

Res. in most favored regions 2,039 (87.6%) 

Res. in additionally allowed regions 278 (12.0%) 

Res. in generously allowed regions 0 


a = 159.1 A, b = 321.8 A,c = 161.9 A, 
a=B=y=90° 


a = 159.7 A, b = 322.7 A,c = 162.2 A, 
a= 8 =y= 90° 

C222, 

50 (2.8)-2.7 

845,526 

114,838 

100.0 (99.9) 

6.6 (3.8) 

10.4 (40.1) 


249,037 
21.6 
25.1 
0.008 
1.79 
34.5 


1,932 (83%) 
375 (16.1%) 
18 (0.8%) 


“ Rnerge = nin — <Iy>/2n2r <I,>, where <J;,> is the mean of the observations J, of reflection h. 
» Ruork = X(|IF,(observed) — F,,(calculated)||)/= F,(observed). Rgee indicates the R factor for a selected subset (5%) of the reflections that was not included in prior 


refinement calculations. 


© Numbers in parentheses are corresponding values for the highest resolution shell. 


Crystallization. SARS nsp10-nsp11 crystals were grown at 291 K by the hang- 
ing drop vapor diffusion method from a sodium chloride system in desalting 
buffer supplemented with 65 mM zinc chloride. The optimum brick-like crystals 
were obtained using a reservoir solution of 1.8 M NaCl, 0.1 M NaH jPOx,, 0.1 M 
KH,PO,, 0.1 M DTT, and 2-(N-morpholino)ethanesulfonic acid (MES; pH 6.5). 
A single crystal was transferred to the reservoir solution (supplemented with 5 M 
sodium formate) for 1 h prior to submersion in a cold nitrogen stream. 

Data collection and processing. The “frozen” nsp10-nsp11 crystal diffracted at 
up to a 2.1-A resolution. Single-wavelength anomalous dispersion (SAD) data 
using the bound zinc were collected for the native nsp10-nsp11 crystal at 100 K 
using an SBC2 (3,000 by 3,000) CCD detector on beamline BL19-ID at the 
Advanced Photon Source (Argonne National Laboratory). The intensity data 
were processed and scaled using the software HKL2000 (11). Data collection 
statistics are summarized in Table 1. 

Phasing, model building, and refinement. Forty-eight heavy atoms were lo- 
cated from a search with the program SHELX (16, 21) that were consistent with 
the solution from a self-rotation function search using the program MOLREP 
(22). Initial phases were calculated using the single-wavelength anomalous dis- 
persion (SAD) method with the program SOLVE (18). Twenty-two sites were 
found to have occupancy greater than 50%, and the remaining sites had occu- 
pancy greater than 30%. Density modification at 2.8-A resolution by solvent 
flipping was performed with the program CNS (3). 

Initial tracing of the monomer polypeptide chain was performed manually 
using the program O (7), and a starting dodecameric model was generated using 
noncrystallography symmetry (NCS) relations generated from CNS. Initial re- 
finement was performed using simulated annealing in CNS with very tight NCS 
restraints. During the later stages of positional refinement, restraints were re- 
laxed and a bulk solvent correction was applied under the guidance of Ree. 
Model geometry was verified using the program PROCHECK (9). Solvent mol- 
ecules were located from stereochemically reasonable peaks in the o“-weighted 
F, — F, difference electron density map, where F,, is the observed structure 
factor amplitude and F, is the calculated structure factor amplitude. Final re- 
finement statistics are shown in Table 1. 

Protein structure accession numbers. Coordinates and structure factors for 
the SARS-CoV nsp10 crystal structures have been deposited in the RCSB Pro- 
tein Data Bank (PDB) under accession numbers 2G9T (for the 2.1-A structure) 
and 2GA6 (for the 2.7-A structure). 


RESULTS AND DISCUSSION 


Overall structure. The cDNA coding for SARS-CoV nsp10- 
nsp11 was amplified using PCR. The coded protein consists of 
amino acid residues 4231 to 4382 of ppla, which are renum- 
bered 1 to 152 hereafter for convenience, and includes both 
nsp10 and nsp11 regions. The recombinant protein was ex- 
pressed in E. coli as a GST fusion protein, purified, separated 
from the GST tag, and crystallized in both the presence and 
absence of additional zinc ion. Crystals grown under these two 
conditions have the same morphology, and their crystal cell 
parameters are similar to each other. Unless otherwise men- 
tioned, the higher resolution crystal (with supplementary zinc 
salt) will be used in the following structural description. The 
crystal form belongs to an orthogonal space group with 24 
copies of nsp10-nsp11 per crystallography asymmetric unit and 
a Vy, of 3.1 A? Da~', where V,, is the Matthews’ coefficient. 
The crystal structure was determined using SAD phasing from 
the bound zinc ions. The structure model was refined at 2.1-A 
resolution to a final Ryu, of 21.9% and R,,.. of 24.7%. Re- 
finement statistics are detailed in Table 1. All 24 crystallo- 
graphically independent copies of nsp10-nsp11 have very sim- 
ilar structures, with all pair-wise Ca root mean square 
deviations (RMSD) being less than 0.5 A. Residues before 
Ala9 (including five leading residues left from the tag) and 
Pro86-Gly88 and after Ser129 could not be traced due to lack 
of interpretable electron density. Since the latter region in- 
cludes the entire nsp1ll peptide, the crystallized protein is 
nsp10 per se; it is called such hereafter. The absence of Cys130 
and subsequent residues is a result of incidental cleavage, as 
indicated by (i) the reduced molecular weight band from so- 
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FIG. 1. Ribbon representation of the nsp10 crystal structure. A. 
One nsp10 trimer is viewed along the noncrystallography threefold axis 
from the outside of a dodecamer. The three protomers are colored in 
magenta, gold, and green. B. The remaining three trimers of the 
dodecamer are shown in the same orientation as that for panel A using 
the same color scheme. C. The same as panel B but rotated by 90°. D. 
The relationship between threefold axes. The twist angle between pairs 
of threefold axes is approximately 108°. E. The complete dodecamer 
structure of SARS-CoV nsp10. The three protomers in a trimer are 
colored in magenta, gold, and green. This figure was prepared with the 
programs Molscript (8), Bobscript (5), and Raster3D (10). 


dium dodecyl sulfate polyacrylamide gel electrophoresis anal- 
ysis compared with that of a freshly prepared protein sample 
and (ii) matrix-assisted laser desorption ionization-time of 
flight mass spectrometry analysis (data not shown). The cause 
of the loss of Cys130 and its C-terminal residues remains un- 
clear. However, analysis of the crystal packing suggests that the 
C-terminal peptide deletion is required for the formation of 
the crystal lattice. 

The 24 independent copies of nsp10 assemble into two iden- 
tical spherical dodecamers related by a local twofold symmetry. 
The novel dodecameric sphere possesses a tetrahedral symme- 
try and can be viewed as the assembly of four nsp10 trimers 
(Fig. 1). The dodecamer is hollowed in the center, with an 
outer radius of 42 A and an inner radius of 18 A (Fig. 2). Two 
zinc binding sites were identified in the three-dimensional 
structure of each nsp10 monomer. The first one is located in 
the N-terminal region and on the inner surface of the dodeca- 
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mer, and the second one is located at the C terminus on the 
outer surface. The missing C tail of nsp10-nsp11 peptide would 
also be located on the outer surface of the dodecamer. Both 
inner and outer surfaces of the dodecamer have a dominantly 
positive electrostatic potential (Fig. 2), although the calculated 
pl of nsp10 is 5.8. 

The monomer fold. The full-length nsp10 polypeptide has a 
mixed a/B fold comprised of five a-helices (al to a5), one 
3,o-helix, and three B-strands (81 to 3) (Fig. 3). The mono- 
mer peptide model consists of residues Ala9 to Ser129 and is 
continuous, except that the region Pro86-Gly88 is missing in 
the «4-83 connecting loop. Nevertheless, in the 2.7-A structure 
which was solved using crystals grown in the absence of addi- 
tional zinc, we were able to build the Pro86-Gly88 loop region 
for some nsp10 monomers, suggesting that this region is flex- 
ible in general. The central core of the nsp10 monomer is an 
antiparallel B-sheet formed by strands B1 (residues 55 and 56), 
62 (65 to 69), and 83 (96 to 100). The central B-sheet is flanked 
on one side by helices «3 (residues 70 to 73) and «4 (75 to 79), 
while helices al (residues 10 to 18), a2 (23 to 32) at the N 
terminus, helix a5 (107 to 113), and the extended C-terminal 
coil shy away from the central core. Residues on the a4 helix 
and «4-83 loop constitute the N-terminal zinc binding site, and 
the C-terminal coil contributes to the C-terminal zinc binding 
site. A DALI (http:/Avww.ebi.ac.uk/dali/) search indicated no 
similar match to the nsp10 monomer in the current Protein 
Data Bank (PDB), suggesting a novel fold for nsp10. 

nsp10 is cysteine-rich, with 13 cysteines in its sequence, 
namely Cys17, Cys41, Cys46, Cys73, Cys74, Cys77, Cys79, 
Cys90, Cys103, Cys117, Cys120, Cys128, and Cys130. Of these, 
Cys74, Cys77, and Cys90 form the N-terminal zinc binding site, 
while Cys117, Cys120, and Cys128 chelate the C-terminal zinc 
ion (Fig. 4). Cys130 is missing from the refined model due to 
the loss of the C-terminal peptide. Although Cys17-Cys46 and 
Cys41-Cys73 pairs are close in the three-dimensional structure, 
with Sy-Sy distances of 5.5 A and 4.7 A, respectively, they do 
not form disulfide linkages. This observation is consistent with 
the presence of high concentrations of reducing agents in the 
crystallization reservoir. 

The trimeric assembly. Three monomers are related by a 
noncrystallographic threefold symmetry to form a trimeric sub- 
unit (Fig. 1A). The trimer is stabilized largely by hydrophobic 
interactions. The buried solvent-accessible surface (SAS) of an 
nsp10 monomer within a trimer is about 620 A? (or about 9% 
of the total SAS of an nsp10 monomer), with 65% contributed 
from hydrophobic atoms. Residue Val21 in the al-a2 loop of 
one monomer forms hydrophobic interactions with residues 
Val57 and Thr58 in strand B1 of an adjacent monomer; resi- 
dues Thr115 and Thr118 of one monomer also form hydro- 
phobic interactions with Thr118 and Val119 of an adjacent 
monomer. Additional hydrogen bonds are formed between the 
N¢ atom of Lys25 in one molecule and the O,, atom of Glu60 
in an adjacent molecule, with a distance of 2.6 A, and between 
the main chain O atom of Pro84 in one molecule and the Né 
atom of Lys95 in an adjacent molecule, with a distance of 3.3 
A. Furthermore, the three monomers are oriented such that 
their C-terminal zinc fingers are clustered around the threefold 
axis, with their zinc ions separated by approximately 14 A. 

Trimer-trimer interactions and the dodecamer architecture. 
The assembly of the 24 copies of nsp10 into two identical 
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FIG. 2. Electrostatic potential of the nsp10 crystal structure. A. The electrostatic potential of a complete dodecamer is mapped on its outer 
molecular surface. Negatively charged regions are colored in red, positively charged regions are colored in blue, and neutral regions are colored 
in gray. The C-terminal-bound zinc ions, which are located on the outer surface, are depicted as red spheres. The position of the front trimer is 
highlighted by a triangle. B. The inner surface electrostatic potential of the dodecamer is shown in the same orientation as that of panel A. C. A 
cross-section of the nsp10 dodecamer to illustrate the minimum and maximum radii of the shell. D. The electrostatic potential surface of an isolated 


trimer. This figure was drawn with the program CCP4 mg (12, 13). 


dodecamers indicates that the dodecamer is a stable structural 
unit of nsp10 under the crystallization conditions. The four 
nsp10 trimers (named trimers 1 to 4) in a dodecamer are 
related by a tetrahedral symmetry (Fig. 1). Any combination of 
three trimers is related by a local threefold symmetry, and so is 
a combination of two trimers related by a local twofold sym- 
metry. The buried SAS of a trimer in the dodecamer is about 
3,040 A? (or about 16% of the total SAS of an nsp10 trimer), 
with 69% contributed by hydrophobic atom groups. The crystal 
structure shows that helix a1 and residues in the «2-81 loop 
(residues 42 to 46) play a key role in trimer-trimer interactions. 
In the first (threefold symmetry-related) interaction region, 
Leul4, Cys17, Ala18, and Cys79 of trimer 1 form a hydropho- 
bic base, which is directed towards Phe19 of trimer 2. At the 
same time, Phe19 of trimer 1 forms hydrophobic interactions 
with the equivalent hydrophobic base of trimer 3, and so on. In 
the second (twofold symmetry-related) interaction region, 
Met44 and Leu45 of trimer 1 are oriented to interact directly 
with their counterparts in trimer 2; Val42 and Tyr96 of trimer 
1 interact with Tyr96 and Val42 of trimer 2, respectively. Both 
Cys17 and Cys79 contribute to stability of the dodecamer ar- 
chitecture, although they do not form a disulfide bridge in the 


reduced conformation state. Following this protocol, the 12 
molecules can assemble to form a pseudododecahedron. 

The zinc ion binding sites. Zinc binding is a major structural 
feature of nsp10. Spectral analysis of the nsp10 crystal during 
synchrotron data collection clearly demonstrated the presence 
of zinc, evident by a clear peak near the zinc absorption edge. 
It allows the nsp10 crystal structure to be solved by the SAD 
method. Furthermore, the 2.7-A resolution structure of nsp10 
determined from crystals prepared in the absence of additional 
zinc shows the same monomer fold, dodecameric architecture, 
and conformation in the zinc binding sites, with an RMSD of 
0.4 A for all Ca atoms in a dodecamer. 

Two bound zinc ions were identified from the crystal struc- 
ture of nsp10 with unambiguous electron density: one located 
in the N-terminal region on the inner surface of the dodecamer 
and the other one at the C-terminal region on the outer surface 
(Fig. 2 and 4). The first zinc binding site is formed by residues 
on helix a4 and the a4-B3 loop. This zinc ion is tetrahedrally 
chelated by the Sy atoms of three cysteine residues (Cys74, 
Cys77, and Cys90) and the N,, atom of His83, which have bond 
distances of 2.3 A, 23 A, 2.4 A, and 2.2 A, respectively (Fig. 
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FIG. 3. nsp10 monomer fold. A. Stereo diagram showing a Ca 
trace of the nsp10 monomer. Two zinc ions are shown as gray spheres, 
and one chelating water molecule is shown as a smaller red sphere. B. 
Stereo ribbon diagram of the nsp10 monomer with B-strands in purple 
and a-helices in gold. Secondary structure elements are labeled. This 
view is rotated by ~90° relative to that of panel A. This figure was 
prepared with the programs Molscript (8), Bobscript (5), and 
Raster3D (10). 


4A; Table 2). This binding site constitutes a CCHC-type zinc 
finger with a C-(X),-C-(X)5-H-(X),_-C sequence motif. 

The second zinc ion is located at the C terminus and is also 
tetrahedrally coordinated (Fig. 4B). Three zinc-chelating cys- 
teine residues (Cys117, Cys120, and Cys128) have distances of 
2.4 A from the zinc ion to their side chain Sy atoms. The fourth 
ligand is a water molecule, with clearly defined electron den- 
sity, whose distance to the zinc ion is 2.5 A. This chelating 
water molecule also interacts with Ser129 through a perfect 
hydrogen bond to form a stabilized hydrogen bond network. 
All bond lengths related to zinc binding are summarized in 
Table 2. Noteworthy is the observation that Cys130 neighbors 
Ser129 in the nsp10 sequence but did not exist in the crystal- 
lized protein. Otherwise, the presence of Cys130 might suitably 
position it to chelate the zinc ion, suggesting the second zinc 
binding site should also be a zinc finger of a C-(X),-C-(X),- 
C-(X)-C sequence motif. This CCCC-type zinc finger is dis- 
tributed on the surface of the dodecamer and is relatively 
flexible in our structure compared to the CCHC-type zinc 
finger (Table 2). However, the flexibility of this region may not 
reflect the conformation of the full-length protein. 

A multiple-sequence alignment of nsp10 from SARS-CoV 
with coronaviruses from groups I, II, and III of the genus 
Coronavirus indicates that all seven observed zinc-chelating 
residues, plus Cys130, are strictly conserved (Fig. 5), implying 
their importance in the functional replicase-transcriptase com- 
plex. In contrast, a number of other cysteine residues in SARS 
nsp10 are not conserved at all. 
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Serl29 Wat Serl29 Wat 


FIG. 4. Stereo views of the two independent zinc binding sites. A. 
The N-terminal zinc binding site with electron density. Key residues 
interacting with the zinc ion are labeled and depicted as sticks with 
carbon, nitrogen, oxygen, and sulfur atoms colored in yellow, blue, red, 
and green, respectively. The zinc ion is shown as a gray sphere. A 
refined-model-phased 2F,-F, electron density map is shown for the 
zinc ion and chelated residues at 1.3 o. Secondary structure elements 
are labeled. B. The C-terminal zinc binding site with electron density. 
The turn between C117 and C120 is labeled. This figure was prepared 
with the programs Molscript (8), Bobscript (5), and Raster3D (10). 


Functional implications. The finding of zinc binding sites in 
nsp10 provides a strong implication for nsp10 biological func- 
tions. The zinc finger is a common DNA binding motif found 
as part of transcription-regulatory proteins, which can often 
include more than one zinc finger in the same peptide chain. 
Moreover, increasing evidence suggests that zinc fingers can 
more widely be used to recognize RNA or even in protein- 
protein recognition. 

Although the DALI search did not find any candidate struc- 
ture similar to SARS-CoV nsp10 from PDB, a PFAM search 
(http://www.sanger.ac.uk/Software/Pfam) for similar sequence 
motifs identified several members of the HIT-type zinc finger 
family as nsp10 homologous candidates. Named after the first 
protein that originally defined the domain, the yeast HIT1 


TABLE 2. Zinc chelation 


Zinc ion Chelating residue Chelating atom Bond length? (A) 
Zn 1 Cys74 S, 2.29 + 0.04 
Cys77 S, 2.27 + 0.04 
His83 No 2.18 + 0.05 
Cys90 Sy 2.40 + 0.03 
Zn 2 Cys117 S, 2.41 + 0.05 
Cys120 Sy 2.45 + 0.15 
Cys128 Ss, 2.38 + 0.11 
Wat997 H,O 2.51 + 0.07 
Ser129 Wat997 H,0 2.76 + 0.16 


“The listed bond length is an average value over the 24 monomers with 
standard deviations. 
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FIG. 5. Multiple-sequence alignment of SARS-CoV nsp10 with representatives from all three groups of the genus Coronavirus. HCOV-229E, 
human coronavirus strain 229E; MHV, mouse hepatitis virus; IBV, avian infectious bronchiolitis virus. The secondary structure for SARS-CoV is 
shown at the top of the alignment; arrows indicate B-strands, and helical curves denote a- or 3,,-helices. Residues highlighted in red are identical 
among the compared proteins; residues highlighted in yellow are conserved. Residues important for zinc binding are marked with green triangles, 
and residues important for stability of the dodecamer are marked with blue vertical arrows. The alignment was generated by ClustalX (20) and 


drawn with ESPript (6). 


protein, the HIT-type zinc finger contains seven conserved 
cysteines and one histidine that can potentially coordinate two 
zinc atoms. While the function of the HIT-type zinc finger is 
unknown, this motif is mainly found in nuclear proteins in- 
volved in gene regulation and chromatin remodeling. To date, 
there are no three-dimensional structures of HIT-type zinc 
finger domains reported in the PDB. Therefore, our nsp10 
crystal structure serves as the first example of a three-dimen- 
sional structure of this novel class of double zinc finger-con- 
taining motif. 

This finding provides strong structural evidence that, like 
other better studied nsp proteins, nsp10 also likely plays a role 
in RNA synthesis, as suggested by other researchers (26). 
nsp10 is involved in network interactions with other nsp pro- 
teins, and the integrity of its zinc fingers seems important for 
such interactions. Experiments on mouse hepatitis virus 
(MHV), a group II coronavirus along with SARS-CoV, dem- 
onstrated the colocalization of nsp10 with nsp7, nsp8, and 
nsp9, providing solid evidence for their interaction in the coro- 
navirus life cycle (2). Our chemical cross-linking experiment 
further demonstrated that SARS-CoV nsp10 can be cross- 
linked with nsp9 (data not shown), which itself interacts with 
nsp8 (17). Furthermore, an MHV ts mutant, Alb ts6, encoding 
a mutant form of nsp10 with a Gln65-to-Glu mutation, was 
shown to have a defect in negative-strand RNA synthesis (15). 
The Gln65 residue, conserved in all three groups of the genus 
Coronavirus, is located on strand 82 of the SARS nsp10 struc- 
ture and hydrogen bonds via the N,. atom to the main chain 
carbonyl oxygen of Gly52. Gln65 is thus important for the 
conformational stability of nsp10 and particularly for the a4 
helix which forms part of the N-terminal CCHC zinc finger. 
Therefore, mutation of Gln65 might be expected to perturb the 
folding of pp1a into a less productive conformation that would 
prevent it from participating in the formation of a replicase- 
transcriptase complex with negative-strand activity. 

Conclusions. The scientific significance of the SARS-CoV 
nsp10 structure is at least threefold. First, nsp10 has a novel 
protein fold. A search with the DALI web engine (http:/Avww 
.ebi.ac.uk/dali/) for structural homologs failed to yield any 
match to the nsp10 fold, suggesting a novel function for the 


nsp10 family members. Second, nsp10 possesses two zinc fin- 
gers, located in the N-terminal region and at the C-terminal 
region, with C-(X)j-C-(X)5-H-(X),_-C and C-(X)-C-(X),-C- 
(X)-C sequence motifs, respectively. These motifs are con- 
served in all three groups of the genus Coronavirus, and our 
crystal structure illustrates for the first time the significance of 
these conserved residues. Further sequence analysis suggests 
that nsp10 is related to the HIT-type zinc finger family, which 
is often found in nuclear proteins involved in gene regulation 
and chromatin remodeling. Thus, our nsp10 crystal structure 
becomes the first of a new class of zinc finger protein three- 
dimensional structures to be revealed experimentally. Third, 
the molecular assembly of nsp10 is a hollow dodecamer with an 
outer diameter of 84 A and an inner diameter of 36 A. Twelve 
C-terminal zinc fingers stick out from the outer surface of the 
sphere, and another 12 zinc fingers are distributed around the 
inner surface. The strong positive electrostatic potential found 
on both the inner and outer surfaces of the dodecamer is 
intriguing, consistent with the probable function of nsp10 in 
the RNA synthesis machinery. 

To date, the structures and functions of several components 
of the replication/transcription machinery have been deter- 
mined, including the nsp3 ADP ribose 1-phosphatase domain 
(14); nsp5 (23, 24), nsp7, and nsp8 in complex (25); and nsp9 
(4, 17). nsp5 is the main protease for cleavage of the replicase 
polyproteins, nsp7 and nsp8 are proposed to function as pro- 
cessivity factors for nsp12 (the RNA-dependent RNA poly- 
merase), and nsp9 is a single-strand RNA binding protein. The 
crystal structure reported here will help to clarify the func- 
tion(s) of nsp10, in which the presence of two zinc fingers 
should enable it to play an important role in RNA synthesis. 
Elucidation of the nsp10 structure will provide further insights 
into the sophisticated replication/transcription mechanism of 
SARS-CoV and other coronaviruses, such as mouse hepatitis 
virus (MHV), human coronavirus strain 229F, and human 
coronavirus strain HKU1. 
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